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Abstract. This paper contributes to the solution of the problem of 
(3JT) transforming a process model with an arbitrary topology into an equiv- 

^ alent structured process model. In particular, this paper addresses the 

subclass of process models that have no equivalent well-structured repre- 
sentation but which, nevertheless, can be partially structured into their 
maximally-structured representation. The structuring is performed under 
a behavioral equivalence notion that preserves observed concurrency of 
i tasks in equivalent process models. The paper gives a full characterization 

Pj^ of the subclass of acyclic process models that have no equivalent well- 

structured representation but do have an equivalent maximally-structured 
^ one, as well as proposes a complete structuring method. 

o 

1 Introduction 

Process models are usually represented as graphs, where nodes stand for tasks or 
^-j- decisions, and edges encode causal dependencies between adjacent nodes. Com- 

00 mon process modeling notations, such as Business Process Model and Notation 

C*~) (BPMN) or Event-driven Process Chains (EPC), allow process models to have 

almost any topology. Structural freedom allows for a large degree of creativity 
00 when modeling. Nevertheless, it is often preferable that models follow certain 

structural patterns. A well-known property of process models is that of (well-) 
structuredness pQ. A model is well-structured, if for every node with multiple 
outgoing arcs (a split) there is a corresponding node with multiple incoming 
arcs (a join), such that the fragment of the model between the split and the 
join forms a single-entry-single-exit (SESE) process component; otherwise the 



model is unstructured. Fig. 1(a) shows a process model. Each dotted box defines 
a component composed from the arcs that are inside or intersect the box. Split s 
has corresponding join z; together they define SESE component Rl. Yet, split u 



has no corresponding join and, thus, the model in Fig. 1(a) is unstructured. 

The motivations for well-structured process modeling are manifold. Structured 
models are easier to layout, understand, support, and analyze [2j. Consequently, 
some process modeling languages urge for structured modeling, e.g., Business 
Process Execution Language (BPEL) and ADEPT. We advocate for a different 
philosophy: The modeling language should provide process modelers with a 
maximal degree of structural freedom to describe processes. Afterwards, scientific 
methods can suggest (whenever possible) alternative formalizations that are 
better structured, preferably well-structured. 
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(a) (b) 
Fig. 1. (a) Unstructured model, and (b) its equivalent maximally-structured version 

In previous work [2J, we proposed a technique to automatically transform 
acyclic process models with arbitrary topologies into equivalent well-structured 
models. The structuring is accomplished under a strong notion of behavioral 
equivalence called fully concurrent bisimulation |1I2| . As an outcome, the resulting 
well-structured models describe the same share of concurrency as the original 
unstructured models. It was shown in [1] (by means of a single example) and 
confirmed in [2J (for the general case of acyclic models) that there exist process 
models that do not have an equivalent well-structured representation. Fig. 1(a) 
is an example of such a model. Though not completely structurable, this model 
can be partially structured to result in its maximally-structured version shown 



in Fig. 1(b) A process model is maximally-structured, if every model that is 



equivalent with it has at least the same number of SESE components defined 
by pairs of a split and join node as the model itself. Note that Fig. l(b)| us es 



short-names for tasks (a, 6, c. . .), which appear next to each task in Fig. 1(a) 



After the initial investigations in [3], this paper gives for the first time a 
complete solution to the problem of maximal structuring of acyclic process 
models. We characterize the class of acyclic process models which do not have 
an equivalent well-structured representation, but which can, nevertheless, be 
maximally structured; and we provide a complete structuring method. 

The remainder of this paper proceeds as follows: The next section gives 
preliminary definitions. Section [3] discusses the structuring technique proposed 
in The technique is summarized as a chain of transformations. We define for 
the first time the notion of a proper complete prefix unfolding which was sketched 
in [3J and which is essential for obtaining sufficient behavioral information to allow 
maximal structuring. Section [4] devises an extension of the structuring technique 
for maximal structuring of process models that do not have an equivalent well- 
structured representation. Section [5] discusses related work and draws conclusions. 

2 Preliminaries 

Preliminaries describe formalisms that will be used later to convey the findings. 
2.1 Process Models and Nets 

This section introduces all subsequently required notions on process models. 

Definition 1 (Process model). A process model P = (A,G,C,type,A, fi) 
has a non-empty set A of tasks, a set G of gateways, An G = 0, and a set 
C E (A u G) x ( A u G) of control flow arcs of P; type : G -*■ {xor, and} assigns to 
each gateway a type; A^> A assigns to each task a name from A + 0. 
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AuG are the nodes of P; a node x e A u G is a source (sink), iS »x = (x» = 0), 
where »x (x«) stands for the set of immediate predecessors (successors) of x. We 
assume P to have a single source and a single sink task; every node of P is on 
a path from source to sink. Each task a e A has at most one incoming and at 
most one outgoing arc, i.e., \»a\ < 1 a |a»| < 1. Each gateway g e G is either a split 
(\»g\ = 1 a \g»\ > 1) or a jom (|»g| > 1 a \g»\ = 1). The semantics of process models 
is usually defined by a mapping to Petri nets. 

Definition 2 (Petri net). A Petri net, or a net, N = (P, T, F) has finite disjoint 
sets P of places and T of transitions, and the /Zoui relation P £ (P x T) u (T x P). 
A wet system (N,M) is a net AT with a marking M : P -*■ No assigning each 
pePa number M(p) of tokens in place p; Mq denotes the initial marking. 

For a node a; e PuT, = {y | (y, x) e P} is a preset, whereas x* = {y \ (x, y) e P} 
is a postset of x; Min(N) denotes the set of nodes of N with an empty preset. 
For X £ PuT, let »X = Uzex •£ and X» = U X eX x *- F° r a binary relation P (e.g., 
P or C) , R + and R* denote irreflexive and reflexive transitive closures of P. 

A net N = (P, T, F) is free-choice, iff V p e P, |p • | > 1 : = {p}. A fo&efed 

net = (P, P, P, T, A) has a function A : P u T -»■ T that assigns each node a 
/a&e/ from T, r e P. If A(i) # r, then < e P is observable; otherwise, t is silent. 

Every process model (Def. [T]) can 
be mapped to a labeled free-choice net 
with a special structure, called WF- 
net [IE]; the net in Fig. [2 



to the model in Fig. 1(a) 




corresponds 
The execu- 
tion semantics of the net (in terms of 
its token game) defines the semantics 
of the model. In our work, we require 
process models to be sound [5], with 
the intuition that a model is sound, iff its corresponding WF-system is sound. 



Fig. 2. A WF-net that corresponds to the 



process model in Fig. 1(a) 



2.2 Unfoldings 

An unfolding of a net system is another net that explicitly represents all runs 
of the system in a possibly infinite, tree- like structure |6|7|8j . In [9] , McMillan 
proposed an algorithm for the construction of a finite initial part of the unfolding, 
which contains full information about the reachable states of a system - a complete 
prefix unfolding. Next, we present main notions of the theory of unfoldings. First, 
we define ordering relations between pairs of nodes in a net. 

Definition 3 (Ordering relations). Let N = (P, T, P) be a net, x,y e PuT. 
o x and y are in causal relation, written x y, iff (x, y) e P + . y and x are in 

inverse causal relation, written y *~n x, iff x -* n y. 
o x and y are in conflict, x #jv £/, iff there exist distinct transitions ti,t2 e P, 
s.t. »t\ n + 0, and (tx,x), (t2,y) e F*. If x x, then x is in self- conflict. 
o x and y are concurrent, x \\n y, iff neither x y, nor y --*jv x, nor x #at y. 
The set Pjv = {"^jv, *^jv, #n, \\n} forms the ordering relations of N. 

Note that in the following we omit subscripts of ordering relations where the 
context is clear. A structure of an unfolding is given by an occurrence net. 
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Definition 4 (Occurrence net). A net N = (B,E,G) is an occurrence net, 
iff : for all b e B holds | • b\ < 1, N is acyclic, for each x e B u E the set 
{y e Bu E \ (y,x) e G + } is finite, and no e e E is in self-conflict. 

The elements of B and E are called conditions and events, respectively. Any 
two nodes of an occurrence net are either in causal, inverse causal, conflict, or 
concurrency relation [6] . An unfolding of a system is closely related to the concept 
of a branching process of a system. A branching process is an occurrence net 
where each node is mapped to a node of the system. 

Definition 5 (Branching process). A branching process of a system S = 
(N,Mq) is a pair f3 = (N',v), where N' = (B,E,G) is an occurrence net and v 
is a homomorphism from N' to N, such that: 
o the restriction of v to Min(N') is a bijection between Min(N') and Mo, and 
o for all ei,e2 e E holds if «ei = «e 2 and v(e\) = v(e-i), then e± = e%. 

The system S is referred to as the originative system of a branching process. A 
branching process can be a prefix of another branching process. 

Definition 6 (Prefix). Let j3i = (N\,V\) and {3<x = (-^21^2) be two branching 
processes of a system S = (N, M ). f3\ is a prefix of (3 2 if N\ is a subnet of A^ 2 , 
such that: if a condition belongs to Nx, then its input event in N 2 also belongs 
to N\ , if an event belongs to N\ , then its input and output conditions in N2 also 
belong to Ni, and v\ is the restriction of v 2 to nodes of N\. 

A maximal branching process of S with respect to the prefix relation is called 
unfolding of the system. Finally, we present a complete prefix unfolding. 

Definition 7 (Complete prefix unfolding). 

Let (3 = (N' , v), N' = (B,E,G), be a branching process of a system S = (N, Mo), 
o A configuration C of j3 is a set of events, C £ E, such that: (1) e e C implies 

that for all e' e E, e' ~* e implies e' e C, i.e., C is causally closed, and (2) for 

all ei,e2 e C holds -.(ei #62), i.e., C is conflict-free, 
o A local configuration of an event e € E, denoted by [e], is the set {e' e E | 

e' e 2?, e' ~» e}, i.e., the set of events that precede e. 
o A set of conditions of an occurrence net is a co-set if its elements are pairwise 

concurrent. A maximal co-set with respect to inclusion is a cut. 
o For a finite configuration C of (3, Cut(C) = (Min(N') u C») \ *C is a cut, 

whereas v{Cut{C)) is a reachable marking of S 1 , denoted by Mark(C). 
o /3 is complete if for each reachable marking M of 5 there exists a configuration 

C in (3, such that: (1) Mark(C) = M, i.e., M is represented in /3, and (2) for 

each transition t enabled at M in N, there exists a configuration C u {e} in 

/?, such that eiC and ^(e) = t. 
o An adequate order < is a strict well-founded partial order on local configura- 
tions, such that [e] c [e'] implies [e] < [e'], where e,e' e E. 
o An event e € E is a cutoff event induced by <, iff there exists a corresponding 

event e' e E, or corr(e), such that Marfc([e]) = Marfc([e']) and fe'] < [e]. 
o (3 is the complete prefix unfolding induced by <, iff f3 is the greatest prefix of 

the unfolding of S" that contains no event after a cutoff event. 

The complete prefix unfolding is obtained by truncating the unfolding at events 
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where the information about reachable 
markings starts to be redundant. Fig. [3] 
shows a complete prefix unfolding of 
the system in Fig. [2| In the prefix, 
event e w is a cutoff event, whereas 
event e v is its corresponding event; this 
relation is visualized by a dotted arrow. 
We write c x , c' x , c", . . . for conditions 
that are the occurrences of place p x ; 
correspondingly for events. The size of 




Fig. 3. A complete prefix unfolding of the 
system in Fig. [2] 




Fig. 4. Structuring chain, cf., [2] 



the prefix depends on the "quality" of the adequate order used to perform the 
truncation. It has been shown that the adequate order proposed in [lOj results in 
more compact prefixes as compared to the one in [S]. 

3 Structuring 

This section discusses the technique for structuring acyclic process models, 
presented in [2J. We elaborate further on the technique by proposing the notion 
of a proper complete prefix unfolding for the first time; we will see that this 
prefix is essential to achieve maximal structuring. 

Fig. [3] shows a chain of phases that 
collectively compose the structuring 
technique. The process model is de- 
composed into a hierarchy of process 
components. Each component is a pro- 
cess model by itself and either well- 
structured or unstructured. An unstructured process component can in some 
cases be transformed into a well-structured one. For this purpose, the compo- 
nent is translated into a workflow system for which the ordering relations of 
its tasks are derived from its proper complete prefix unfolding. If the ordering 
relations have certain properties, the unstructured component can be replaced by 
a well-structured hierarchy of smaller components that define the same ordering 
relations. In the following, we present each phase of the structuring in detail, 
whereas in the next section we extend the technique to allow maximal structuring. 

From process models to unfoldings. Fig. [5] shows a process model that will 
be used in this section for explaining the structuring technique. We employ the 
Refined Process Structure Tree (RPST) [11I12| to learn its structural character- 
istics. The RPST is built from four kinds of process components: A trivial (T) 
component consists of a single flow arc. A polygon (P) represents a sequence of 
components. A bond (B) stands for a set of components that share two common 
nodes - an entry and exit. Any other component is a rigid (R). A component is 
canonical, iff it does not overlap (on edges) with any other component. The set of 
all canonical process components forms a hierarchy that can be represented as a 
tree - the RPST. The parent of a process component is the smallest component 
that contains it. The root of the RPST captures the whole process model, and a 
leaf of the RPST is a flow arc. The dotted boxes in Fig. [5] indicate the components 
and their hierarchy, e.g., PI is a polygon which consists of trivial components 
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(i,s), (z,o), and rigid Rl. Observe that we do not explicitly visualize simple 
components, i.e., trivials and polygons composed of two trivials. 

Polygons and bonds correspond to 
sequences and well-structured compo- 
nents of mutually-exclusive or con- 
current threads. Therefore, a pro- 
cess model is well-structured, iff its 
RPST contains no rigid components. 
A process model can be structured by Fi §- 5 - A process model 

traversing its RPST bottom-up and replacing each rigid component by its equiv- 
alent well-structured component. The difficult step is to find this equivalent 
well-structured component. 

The key idea of structuring is to refine a rigid component R, i.e., a node of 
the RPST, by a subtree of well-structured RPST nodes which define the same 
behavioral relations between i?'s children. The first step when structuring a rigid 
component is to compute the ordering relations of its child nodes. We obtain these 
by constructing a complete prefix unfolding of R's corresponding WF-system. 
The complete prefix unfolding captures information about all reachable markings 
of the originative system, but has a simpler structure, i.e., it is an occurrence net 
(Def. [4]). To capture all well-structuredness contained in R, the complete prefix 
unfolding must have a specific shape called proper. 

Definition 8 (Proper complete prefix unfolding). Let f3 = (N' ,v), N' = 
(B,E,G), be a branching process of an acyclic system S = (N,Mq). 
o A cutoff event e e E of ft induced by an adequate order < is healthy, iff 

Cui([e]) \ e» = Cut(\corr(e)]) s corr(e)». 
o P is the proper complete prefix unfolding, or the proper prefix, induced by 
an adequate order <, iff j3 is the greatest prefix of the unfolding of S that 
contains no event after a healthy cutoff event. 

Fig. [6]shows a proper prefix of the sys- c - e " c -- 

tern that corresponds to the rigid com- 
ponent Rl in Fig. [5] A proper prefix 
contains all information about well- 
structuredness, i.e., all paired gate- 
ways of splits and joins, in a rigid in 
the following way. j3 represents each 
xor split as a condition with multiple 6 - A proper com P lete P refix unfolding 

post-events; each xor join is identified by the post-conditions of a cutoff event e 
and its corresponding event corr(e), e.g., c u and c' u in Fig. [BJ The notion of a 
cutoff event guarantees that f3 contains every xor split and join. An important 
observation here is that corresponding pairs of xor splits and joins are always con- 
tained in the same branch of j3. An and split manifests as an event with multiple 
post-conditions in j3, whereas an and join is an event with multiple pre-conditions. 
The healthiness requirement on cutoff events ensures that concurrency after an 
and split is kept encapsulated, i.e., if several concurrent branches are introduced 
in the unfolding they are not truncated until the point of their synchronization, 
i.e., the and join. Such an intuition supports our goal to derive a well-structured 
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process model, as bonds of a process model that define concurrency must be 
synchronized in the same branches of the model where they originated. 

A proper complete prefix unfolding of an acyclic system is clearly finite. 
For structuring purposes, when computing a proper prefix, we use an adequate 
order proposed in |10j . This adequate order results in minimal complete prefix 
unfoldings for safe systems, if one only considers information about reachable 
markings induced by local configurations, which is the case for healthy cutoff 
events. Thus, the adequate order from fTTj] yields a minimal proper complete 
prefix unfolding of a safe acyclic system, which applies to our case as sound 
free-choice nets are safe [13] , 

From unfoldings to graphs. The proper complete prefix unfolding of a process 
component R contains all ordering relations of all children of R in the RPST. 
For restructuring, R (an RPST node) is to be refined into a subtree along these 
ordering relations. The refinement requires this information to be preserved in a 
hierarchically decomposable form: an ordering relations graph. 
Definition 9 (Ordering relations graph). 

Let (3 = (N' , is), N' = (B, E, G), be a proper complete prefix unfolding of a sound 
acyclic free-choice WF-system S = (N, Mi), N = (P, T, F, T, A). 

o Two nodes x and y of N' are in proper causal relation, denoted by x >* y, 
iff (x, y) e G + or there exists a sequence (ei, . . . , e n ) of proper cutoff events 
of /3, e, € E, 1 < i < n, n e N, such that (x, ei) € G*, (corr(e n ),y) e G + , and 
(corr(ei), e^+i) € G* for 1 < i < n. We denote by <-< the inverse of ». 

o Let TZ = {~^,*^,#,||} be the ordering relations of N'. The proper conflict 
relation of N' is El = # \ (>-»• u •«). The set TZ' = <-<, ffl, ||} forms the proper 
ordering relations of N' . 

o We refer to TZ as observable (proper) ordering relations, iff the relations in TZ 
only contain pairs of events that correspond to observable transitions of N. 

o Let TZ = {»,•*-<, EB, ||} be the observable proper ordering relations of N'. An 
ordering relations graph Q = (V, A,B,o~) of N' has vertices V £ E defined by 
events of f3 that correspond to observable transitions of N, arcs A = » u EB, 
and a labeling function a:V^B,B = T s ~ {t} with cr{v) = A(^(u)), v e V. 






(a) (b) (c) (d) 

Fig. 7. (a),(c) Ordering relations graphs, and (b),(d) the modular decomposition trees 
An ordering relations graph of a process component R captures minimal and 
complete information about the ordering relations of events that correspond to 



observable transitions of a system. Fig. 7(a) visualizes the ordering relations graph 



of the proper complete prefix unfolding in Fig. [6] The proper causal relation 
updates the causality relation of the prefix f3 to overcome the effect of unfolding 
truncation, e.g., a » d, b » f, d >* e, etc. Fig. |7(a)| denotes that a and c are in 
proper causal relation, a and b are in proper conflict, whereas e and / are concur- 
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rent. Figures [7(c) and |7(d)| show the graph and its MDT of the model in Fig. 1(a) 



From graphs to process models. The ordering relations graph not only en- 
codes the ordering relations, it also inherits all information about well-structured- 
ness from the proper prefix, i.e., pairing of gateways is preserved. The structuring 
technique in [5] proceeds by parsing the graph into a hierarchy of subgraphs that 
encode ordering relations of well-structured components. The thereby discovered 
hierarchy of subgraphs is then used to refine a rigid component into a subtree. As 
shown in [2] , each subgraph corresponds to the notion of a module of the modular 
decomposition of a directed graph [TJ] - thus discovering well-structuredness in 
the relations of an unstructured process component. 

Let Q = (V,A,B,a) be an ordering relations graph. A module M £ V in Q is 
a non-empty subset of vertices of Q that are in uniform relation with vertices 
V \ M, i.e., if v e V \ M, then v has directed edges to all members of M or to 
none of them, and all members of M have directed edges to v or none of them 
do. However, vi, v 2 e V \ M, v\ + u 2 can have different relations to members of 
M. Moreover, the members of M and V \ M can have arbitrary relations to 



each other [M]. For example, {e, /} is a module in Fig. 7(a) Two modules Mi 
and M 2 of Q overlap, iff they intersect and neither is a subset of the other, i.e., 
Mi \ M 2 , Mi n M 2 , and M 2 \ Mi are all non-empty. Mi is strong, iff there exists 
no module M 2 of Q, such that Mi and M 2 overlap. The Modular Decomposition 
Tree (MDT) of Q is a set of all strong modules of Q. The modular decomposition 
substitutes each strong module of Q by a new vertex and proceeds recursively. 
The result is the MDT which is a canonical rooted tree and unique. 

Now, a rigid process component R of an RPST can be restructured by refining 
R in the RPST to a subtree Tr. The root of Tr is child of i?'s parent, each child 
of R is attached to a leaf of Tr, the nodes of Tr are defined by the modules of 
the MDT of R's ordering relations graph. The type of a node of Tr is determined 
by the characteristics of its defining MDT module, as follows. 

We refer to singletons of V as the trivial modules of Q. Let M be a non-trivial 
module. M is complete (C), iff the subgraph of Q induced by vertices in M is 
either complete or edgeless. If the subgraph is complete, then we refer to M as 
xor complete. If the subgraph is edgeless, then we refer to M as and complete. 
M is linear (L), iff there exists a linear order {xi, . . . ,x\m\) °f elements of M , 
such that there is a directed edge from X{ to Xj in Q, iff i < j. Finally, if M 
is neither complete, nor linear, then M is primitive (P); a primitive module is 
concurrent iff it contains a pair of vertices that are not connected by an edge. 



Fig. |7(b)| shows the MDT of the graph in Fig. 7(a)| Besides the trivial modules, 



the MDT contains linear LI, and complete CI, and primitive PI, Module LI is 
the root module, whereas trivial modules are leafs of the MDT. 

An acyclic process model has an equivalent well-structured model, if its 
ordering relations graph contains no concurrent primitive module. According to [2], 
behavior captured by other module classes can be expressed by well-structured 
process components. A trivial module corresponds to a task. A linear module 
corresponds to a polygon component. An and (xor) complete module corresponds 
to a bond with and (xor) gateways as entry and exit nodes. A primitive module 
without concurrency can be restructured using standard compiler techniques [15] . 
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Given all of the above, Alg. [I] summarizes the structuring technique. 

Algorithm 1: Structuring Acyclic Process Model (Component) 

Input: An acyclic process model (component) W 

Output: A well-structured process model that is equivalent to W 

Construct WF-net N that corresponds to W 
Construct proper complete prefix unfolding /3 of (A, M;) 
Construct ordering relations graph Q of j3 
Compute M - the MDT of Q 

II Construct process model P by traversing M in postorder 
foreach node m of M do 

if m is trivial then Construct a task 

if m is and complete then Construct an and bond component 
if m is xor complete then Construct a xor bond component 
if m is linear then Construct a trivial or polygon component 
if m is primitive without concurrency then 

] Construct a well-structured process component using compiler techniques 
else FAIL 

return P 

Alg. [I] traverses the MDT of an ordering relations graph of a rigid process 
component and constructs for each encountered module a process component 
from components that correspond to its child modules. The resulting hierarchy 
of components is the subtree that refines the rigid component. 
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(a) (b) 

Fig. 8. (a),(b) Process models that are equivalent with the process model in Fig. [5] 

Fig. [8] shows two process models that are equivalent with the process model in 
Fig. [5] The model in Fig. |8(a)| is obtained by constructing process components that 
correspond to modules of the MDT in Fig. |7(b)| Here, polygon PI corresponds to 
linear LI, bond Bl to and complete CI, and rigid Rl to primitive PI. The model 
in Fig. |8(b)| is obtained from Fig. 8(a) by structuring rigid Rl. The structuring 
can be achieved by employing ID-0 transformation rule from |15j . 



4 Maximal Structuring 

Recall from Sect. [I] that a process model is maximally structured iff every 
equivalent model has the same number of process components defined by pairs 
of splits and joins as the model itself. In the light of Sect. |3j the open problem 
is to obtain a maximally structured process component R. R has this property 
iff (1) all primitive modules in the MDT of R's ordering relations graph are 
concurrent, and (2) there exists a bijection between non-singleton modules of the 
MDT and non-trivial components of the RPST which assigns to each primitive 
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module a rigid component, to each complete a bond, and to each linear a polygon. 
The maximal structuredness of R follows from the maximality of the modular 
decomposition: the ordering relations graph of R inherits all information about 
well-structuredness from the proper complete prefix of R, and the MDT maximizes 
modules with a well-structured representation because of the decomposition into 
strong modules. If a primitive module M with concurrency has well-structured 
child modules, then these modules are maximal again within M. Only the relations 
within M have no structured representation as a process model, where M is 
minimized by maximizing structuredness around and inside M . This yields a 
technique for maximal structuring: one must be able to synthesize a process 
component that exhibits the ordering relations described in M. Such a technique 
would allow to define unstructured process model topologies when mapping 
hierarchies of modules onto hierarchies of process components in Alg. [T] e.g., 
primitive modules in Fig. |7(b)] and Fig. |7(d)] onto rigid components in Fig. |8(a)' 



and Fig. 1(b) The resulting process model would be maximally structured. 



Proper \\ Vl Prime U \V \V Maximally- 
Process \\ mc . = „ =tam= \\ complete \\ re | a j™^ algebraic % Event Vi Occurrence V> Nets * structured 
prefix II II coherent n structures u nets n u process 
unfoldings II u posets II II II II models 



Fig. 9. An extension of the structuring chain of Fig. [4] 

In this section, we propose a solution to the synthesis problem, i.e., given an 
ordering relations graph (a module of an MDT) we synthesize a process model 
(a component of the RPST) that realizes the relations described in the graph. 
The solution consists of several phases that employ the results on translations 
between the languages of domain and net theory [TB], and on folding prefixes of 
systems [17] . Figj9] shows an extension of the structuring approach which was 
proposed in Fig. |4[ Next, we discuss each phase of the extension in detail. 

From graphs to partial orders. This section describes a translation from an 
ordering relations graph to a partial order of information. The partial order is an 
alternative formalization of the meaning of the behavior captured in the graph. 



The ordering relations graph in Fig. 10 is the running 

example of Sect. [4] The graph is a primitive module with all •• Q0 Xj-Q \ 
types of relations; d and d! are events with the same label. C~s*'\/ V-\' 

First, we give some definitions from the theory of partially '-.vf/* /\ 
ordered sets (posets) [16]. Let (-D,-) be a poset. For a subset \ / 
X of D, an element y e D is an upper (lower) bound of X, V 
iff x E y (1 3 y), for each element x e X. An element y e D Fig. 10. An order- 
is a greatest (least) element, iff for each element x e D holds ing relations graph 
x E y An element y € D is a maximal (minimal) element, iff there exist 

no element x e D, such that y 1= x (x 1= y); D max and D min denote the sets of 
maximal and minimal elements of D. Two elements x and y in D are consistent, 
written x f y, iff they have an upper bound, i.e., x f y -o- 3 z e D : x ^ z a y ^ z; 
otherwise they are inconsistent. A subset X of D is pairwise consistent, written 
X", iff every two elements in X are consistent in D, i.e., X^ <=> V.x, y e X : x f y. 
The poset (-0,!=) is coherent, iff each pairwise consistent subset X of D has a 
least upper bound (lub) uX. An element x € D is a complete prime, iff for each 
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subset X of D, which has a lub uX, holds that x != uX =^ 3 y e X : x ^ y. Let 
P = (D, != ) be a poset. We write *Pp for the set of complete primes of P. The poset 
P = (D, 9 ) is prime algebraic, iff *}3p is denumerable and every element in D is the 
lub of the complete primes it dominates, i.e., V x e D : x = u{y \ y e typ ai/E x}. 
A set S 1 is denumerable, iff it is empty or there exists an enumeration of S that is 
a surjective mapping from the set of positive integers onto S. 

The behavior captured in an ordering re- „ a b d e o} 

lations graph can be given as a partial or- f 
der of information points. Similar to |16j . < ' 't p ' t <'' a ' c ' d '°> 

the information points are chosen to be left- {a / e} {a ^ d} {a c d . } {i a ( e} {i , a ,b,d H i,a,i,d'> 
closed and conflict-free subsets of vertices of j \ tit *\ T t 
the graph. Each such set captures the his- < b ' e > < a < b > < a ' c > <'^ b > 
tory of events of some run of a system. Let T/ T,/ T T/ \/ T 

. {b} {a} {c} {i,b} {i,a} {i,c> 

Q = (V,A,B,(j) be a graph and let W be a 

subset of V. W is conflict-free, iff V Ui,U2 e { 1 } 

W : («i,W2) ^ Av («2,wi) £ A. W is left-closed, 

iff V «i e W V v 2 e V: (U2,«i) e j4a («i,w 2 ) £ ( a ) ( D ) 
^4 => u 2 e W . We define £[£] as the partial 

order of left-closed and conflict -free s ubsets Fi S- n - ( a ) Poset > and ( b ) augmen- 

of V, ordered by inclusion. Fig. |TT(a)1 shows ted P oset ° btained from Fi S- M 



1 inspired by Thm. 8 in |16j . characterizes the 



C of the graph in Fig. 10 Thm. 
posets £[G]- 

Theorem 1. Let Q = (V,A,B, a) be an ordering relations graph. Then, C[G] = 
(H, c) is a prime algebraic coherent partial order. Its complete primes are those 
elements of the form [v] = {v' e V | (v',v) € B*}, where B = {a e A | oT 1 £ A}. 

Proof. Let X £ H be pairwise consistent. Then, uX is conflict-free. uX = uX and, 
hence, C[G] is coherent. Each [v], v e V, is clearly left-closed and conflict-free. 
Let X c H have lub uX. X is pairwise consistent and uX = uX. Each [v] is a 
complete prime. If [v] £ uX, then v e uX and for some x e X holds vex and, 
thus, [v] c x. It holds for each X e H that X = u{[v] | v e uX}. Thus, each 
element of C[G] is a lub of the complete primes below it. □ 
Given an ordering relations graph, one can construct C[G] = (H,Q) iteratively. 
Let h\ and hi be subsets of V, such that hi \ h% = {v}. Then fti,ft2 e H, iff 
h 1 = or 3 a e h 1 : (v,a) £ A, and V 6 e hi ■ (b, v) £ A v (v,b) £ A, and 
V ce V\ ft-i, (c,u) 6A,(w,c)M3(i6/i 1 : (c,d), (d,c) e A. 

Let £[5] = (H,c.) be a partial order of an ordering relations graph. We 
augment C[G] with two fresh events i, o £ V. These events are designed to ensure 
the existence of a single source and single sink. An augmented partial order of 
G is C*[G] = (H*,£), where H* = u {ft u {i} \ h e H} u{hu {i,o} \ he H max }. 



Fig. 11(b) shows C* of the graph in Fig. [TOj After adding the minimal and 
maximal elements, the topology of posets stays unchanged, so C* is a prime 
algebraic coherent poset. 

From partial orders to event structures. The next transformation step 
deals with translating partial orders to event structures. Event structures are 
intermediate concepts between partial orders and occurrence nets. The use of 
this intermediate concept was extensively studied in |16j . 
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Definition 10 (Labeled event structure). 

An event structure is a triple £ = (E,<,®), where E is a set of events, < is a 
partial order over E called the causality relation, and ffi is a symmetric and 
irreflexive relation in E, called the conflict relation that satisfies the principle 
of conflict heredity, i.e., Vei, e2, e^ e E : e\ > e2 ffi e% =>• e\ ffi e^. A labeled event 
structure £ = (E, <, ffi,C, k) additionally has a set C of labels, t eC, and n: E -> C 
assigns to each event a label. 

An ordering relations graph Q differs from an event structure £ in that Q allows 
violations of conflict heredity. These violations, however, are not harmful; they 
express equivalent runs of a system. These equivalent run are visible in posets 
and become explicit in event structures. A formal procedure for obtaining an 
event structure from a graph can be intuitively understood as unfolding of the 
graph. Next, we define a construction of an event structure from a poset. The 
definition is an extension of Def. 18 in [TB] ; it incorporates propagation of labels 
of an originative ordering relations graph to the corresponding event structure. 

Definition 11 (Event structure of partial order). 

Let Q = (V, A, B, a) be an ordering relations graph and let P = (H,c.) be an 
(augmented) prime algebraic coherent partial order of Q. Then, V[P] is defined 
as the labeled event structure (E,<,®,C, n), where E = typ, < is £ restricted to 
*Pp, for all e\, e<i e *}3p : e\ ffie2, iff e.\ and e2 are inconsistent in P , and C = Bu{t}. 
Let e e E, and define e as e € e \ U aC e,ae_ff a - Then, ft(e) = er(e), if e e V; otherwise 
n(e) = t, for all e e E. 



Fig. [12(a)1 visualizes V[L*[G]] for the 
graph Q of Fig. [lOj Events are com- 
plete primes of £*[<?] (see in bold- 
face in Fig. [TT] and next to vertices 
in Fig. |l2| ). Directed edges encode 
causality (transitive dependencies are 
not shown), dotted edges represent im- 
plicit concurrency, whereas an absence 
of an edge hints at a conflict relation. 
The event structure in Fig 




12(a) 



is 



(a) (b) 
Fig. 12. Event structures obtained from (a) 



Fig. 11(b) and (b) poset of Fig. 7(a 



structurally similar to the graph in 
Fig. [TO} they differ only in relations 
with fresh i,o events. In general, event structures tend to have a different struc- 
ture compared to the originative graphs. For instance, Fig. |12(b)| shows the event 
structure derived from the augmented poset of the graph in Fig. |7(a) 

From event structures to occurrence nets. Nielsen et al. in [TB] show a 
tight connection between event structures and occurrence nets. Let = (B, E, G) 
be an occurrence net. Then, £[N] = (E,G* r\E 2 ,4t 1 N n£ 2 ) is a corresponding 
event structure. The next theorem, borrowed from |16j . defines the construction 
of an occurrence net from an event structure. 

Theorem 2. Let £ = (£*,<, ffi), E + 0, be an event structure. Then, there exists 
an occurrence net rj[£], such that £ = £[?7[£ ]]• 

Proof. Define the set CE = {x £ E \ Vex, e 2 6 x : e\ + e-i => e\ # e^}- The events 
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Fig. 13. Occurrence net obtained from Fig. 12(a) by Thm. [2] 

of n[£] are exactly those in E. The set of conditions is defined by B = {(e, x) | e e 
E, x e CE, and Ve' e x : e < e'} u {(0, x) \ x e CE, and x + 0}. The flow relation 
is defined by G = {((e,x),e r ) \ (e,x) e B,e' e xj u {({0,x} ,e') \ (0,x) e B,e' e 
x} u {(e, (e,x)) I (e,x) e J3}. It follows, that n[£ ] is an occurrence net for which 
# = ©, and hence £[f?[£ ]] = £ . □ 

Fig. |13| shows the occurrence net which is constructed from the event structure 
shown in Fig. 12(a) using the principles of Thm.[2j Thm. [2] defines a "maximal" 
construction, cf.. |loj. i.e., the resulting nets tend to contain much redundancy. 
With Def. [I2] we aim at preserving only essential behavioral dependencies. 

Definition 12 (Conditions). Let N = (B,E,G) be an occurrence net. 
o A condition b e B is redundant, iff b» = a 3 b' e B, b + b' : b' e (•&)• or 

•b = A 3 b' e B, b * V : (b* = &'•) a (.6' + 0). 
o A condition b € B is subsumed by condition b' € B,b+ b' , iff •& = »6' a &• c 
o A condition b e B denotes a transitive conflict between events e,e' € E 1 , iff 

3 b' e B,btb' 3 e" e b';e±e" ± e' : »b= »b' /\e' eb»n b' • /\ e eb» /\ e -* N e". 
o Any other condition is required. 

A redundant condition has no pre-event (post- 
event), and is not a pre-condition (post-condition) 
of the initial (a final) event. A subsumed condi- 
tion b always has a sibling b' expressing the same 
constraints for larger set of events; depicted light- 
grey in Fig. [13] A condition b denotes a transitive 
conflict between two events, if an "earlier" con- 
dition b' already denotes this conflict; depicted 




Fig. 14. Simplified occurrence 
net obtained from Fig. |13| 



dark-grey in Fig. 13 All these conditions can be 
removed from the occurrence net without loosing 
information about ordering of events. For our structuring, we remove from an 
occurrence net all redundant and all subsumed conditions, and all transitive 
conflicts which have at least two post-events. Removing these conditions from the 



net in Fig. 13 yields the net in Fig. 14 Note that all conditions are labeled r, 
and that transitive conflicts with one post-event will be needed for the next step. 

From occurrence nets to nets. The simplified occurrence net obtained by 
Thm. [2] and Def. 12 is already a process model - though one with duplicate 
structures and multiple sinks. We obtain a more compact model with a single sink 
by folding the occurrence net. Intuitively, we fold any two nodes of an occurrence 
net which have isomorphic successors into one node. This operation preserves all 
ordering relations and all behavior represented in the net. Folding finite occurrence 
nets succeeds with the following inductive definition of a future equivalence. 
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Definition 13 (Future equivalence, Folded net). Let N = (B,E,G,T,X) 
be a labeled occurrence net, V 6 e B : A(6) = r. An equivalence relation ~y is a 
future equivalence on N, iff there exists an equivalence ~ £ (B x B) u (E x 
o For all 6, b' e B, if 6» = 6'», then 6 ~ b' . 

o For X,y c BuE, write X ~ Y, iff X = . . . , x k }, Y = { Vl ,...,y k }, s.t. 

Xi ~ j/j, for 1 < i < fe; 
o For all x, y e i? u _E, if A(a;) = X(y) and £• ~ y» and ->(x \ \ N y), then x ~ y. 
The future equivalence defines x ~ f y, iS x ~ y a (x, y e E »x ~ »y). 
Let ~/ be a future equivalence on iV; write (cc) / = {y | y ~y a;} for the equivalence 
class of x. Then the folded net of N under ~* is the net iVy = ({(&)/ I & 6 
S}, {(e)/ | e e E),{((x) f , (y) f ) \ (x, y) e F},Xf) with X f ({x) f ) = X(x). 
Considering the occurrence net N in Fig. |14[ the equivalence ~ / with the classes 
{hi,bi5}, {e8,e 9 }, {b 9 ,b n }, {612,^13}, {ee,e 7 }, {b 6 ,b s }, and all other nodes 
remaining singleton, is a future equivalence on N. Folding N under ~ f yields the 









eg, 69 




^% 1 — 1 




S r 











net in Fig. 15 Folding AT into Nj preserves the behavior of N, cf., [T71 Thm. 8.7]. 

Each occurrence net has several future equiv- 
alences differing in how pre-conditions of events 
are folded. A simple algorithm to compute a fu- 
ture equivalence implements the steps of Def. [13] 
and uses branching and backtracking whenever 
for a condition b there are two or more pair- 
wise concurrent conditions that could be folded 
with b. Each option is explored and the most- Fig. 15. Folded net obtained 
compact folding is chosen. For instance, after fr° m Fig- [W] 
folding 614 ~f 615 and e% ~f eg, for 613 the folding options 612 and bg can be 
explored; backtracking yields 612 as the better match for 613 because of their 
d-labeled pre-events. Various heuristics improve exploration and backtracking. 

If the original process model has control-flow edges between gateways without 
any visible activity, folding gets more involved. In this case, the occurrence 
net contains supposedly equivalent events with different numbers of required 
pre-conditions, e.g., es and eg with required pre-conditions {612} and {611,613}, 
respectively. Fortunately, Thm. [2] encodes all possible invisible control-flow edges 
as transitive conflicts with one post-event (grey-shaded conditions in Fig. |14[ ) . 
When extending the future equivalence to pre-conditions of events, a subset of 
these transitive conflicts needs to be taken into account as follows: 

o Pick the largest set B' of required pre-conditions, e.g., 6n and 613. 

o For each 6 e B' , extend the folding equivalence with a required condition or 
a transitive conflicts, e.g., 613 ~f 612, 6n ~f 6g. 

o Finally, remove all transitive conflicts not required in this step, e.g., 6i . 
Applying this procedure on our example yields the folded net shown in Fig. |15| 
without the dashed conditions and arcs. 

From nets to process models. The folding was the second to last step in 
synthesizing a process model from a given ordering relations graph. We obtained 
a Petri net Nt which we now transform into a process model P. 

The initial transition i (final transition o) is mapped to the start (end) node 
of P. Every other transition of Nf becomes a task of P. Gateways of P follow 
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from non-singleton pre- and postsets of nodes of 
Nt . A transition t with two or more pre- places is 
preceded by an and join; two or more post-places 
of t define an and split; the pre- and postsets of 
places define xor splits and joins, respectively; 
and gateways are always positioned closer to the 
task. In our example, e%» defines and split s in 
Fig. |16| b 2 * defines xor split u, defines and Fig. 16. Process model obtain- 
split v, »(e 6 ,e 7 ) defines and join t, and •{b 6 ,b$) ed from Fig. [TBI 
defines xor join x positioned between t and v (and gateways closer to tasks); 
correspondingly for all other gateways. The arc from e 2 to (bg,bn) which was 
obtained from a transitive conflict (Def. |12[ ) results in an important control-flow 
arc from w to y without any task. 

5 Related Work and Conclusion 

In this paper, we addressed the problem of structuring acyclic process models. It 
is well known that any flowchart can be structured |15j . but the same claim does 
not apply for process models comprising concurrency [TJ. Some works have been 
devoted to the characterization of sources of unstructuredness |18ll9j and to 
development of methods for structuring process models with concurrency |2QI21j . 
In [2] , we presented the first full characterization of the class of acyclic process 
models that have an equivalent structured version along with a structuring method. 
The method stops when the input model contains an inherently unstructured 
fragment. This paper completes the approach by providing a method to synthesize 
the fragments corresponding to inherently unstructured parts of the input model. 

Close to our setting, the problem of synthesizing nets from behavioral speci- 
fications has been a line of active research for about two decades [22123] . This 
area has given rise to a rich body of knowledge and to a number of tools, e.g., 
petrify [22J and viptool [23]. Yet, these solutions fail in our setting: petrify 
aims at maximizing concurrency while our synthesis preserves given concurrency, 
viptool synthesizes nets with arc weights, which do not map to process models. 

The approach is implemented in a tool, namely bpstruct, which is publicly 
available at http://code.google.eom/p/bpstruct. The running time of our 
structuring technique is mostly dominated by the time required to compute 
proper prefixes, which for safe systems is 0((\B\/n) n ) [10], where B is the set of 
conditions of the prefix and n is the maximal size of the presets of the transitions 
in the originative system. All other steps can be accomplished in linear time. 
Concerning the extension for maximal structuring, the theoretic discussion in 
this paper implies exponential time and space complexity when constructing 
posets (this is due to our wish to be close to the existing theory). However, in 
practice, given an ordering relations graph one can construct a poset which only 
contains information from the graph, without introducing duplicate events, and 
thus stay linear to the size of the graph. At the theoretical level this requires 
introduction of a concept of a cutoff for posets followed by an adjustment of the 
theories along subsequent transformation steps. The folding step is a reverse of 
unfolding and, thus, in the best case can be performed in the same time. The 
fact that the running time depends on the size of the result, allows introduction 
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of a heuristic to terminate computation if the result gets large, e.g., the event 
duplication factor is larger than two. However, in practice we have never observed 
such a need with our implementation always delivering the result in milliseconds. 
Our ongoing work aims at extending the method to handle models with loops. 
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